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Abstract 

We study semigroups of labellings associated to a graph. These generahze the Jukes-Cantor model 
and phylogenetic toric varieties defined in [Buczf 2] . Our main theorem bounds the degree of the 
generators of the semigroup hy g + 1 when the graph has first Betti number g. Also, we provide a series 
of examples where the bound is sharp. 
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1 Introduction 

Throughout the article Q is a non-oriented graph. We study a subset t{Q) of the set of all labellings of 
edges of Q by integers. It has a natural structure of a graded semigroup with edge- wise addition (see ^for 
the definition). We call it the phylogenetic semigroup of Q, since the conditions on the labels come 
from phylogenetics. The first named author studied it in ^Buczl2j as a generalisation of the polytope 
defining the Cavender-Farris-Neyman [NeymTl] model of a trivalent phylogenetic tree. This model was 
studied in many papers and is often called the 2-state statistical Jukes-Cantor model |BW07j . |SS05j . 
[PS05] . and |Buczl2P . This is the simplest group-based model. Hence the associated algebraic variety 
is a toric variety, see jSSOSj . and it is the projective spectrum of C[r(^)]. Its equations are calculated in 
[SS05| . and its geometric properties are examined in |BW07j . 

More recently Sturmfels and Xu [SXlOj proved that given the number of leaves n, the Jukes-Cantor 
model of a trivalent tree is a sagbi degeneration of the projective spectrum of the Cox ring of the blow-up 
of P"~3 jj-^ points. This variety is closely related to the moduli space of rank 2 quasi-parabolic vector 
bundles on with n marked points. 



'^We thank Elizabeth Allman for bringing the original name of this model to our attention. 
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Further work in this direction was done by Manon in |Mano09] and |Manoll] . He used a sheaf of 
algebras over moduli spaces of genus g curves with n marked points coming from the conformal field 
theory. The case 5 = is the construction of |SX10j . thus Manon's work generalises the Sturmfels-Xu 
construction. The semigroup algebras C[r(t7)] are the toric deformations of the algebras over the most 
special points in the moduli of curves in Manon's construction. Here G is the dual graph of the reducible 
curve represented by the special point. 

Jeffrey and Weitsmann in |JW92] studied the moduli space of flat S'?7(2)-connections on a genus g 
Riemann surface. In their context a trivalent graph G describes the geometry of the compact surface of 
genus g with n marked points. They considered a subset of Z-labellings of the graph, which is exactly 
TiG)d, the d-th graded piece of t{Q). They proved that the number of elements in this set is equal to the 
number of Bohr-Sommerfeld fibres associated to C^'^, where C is a natural polarising line bundle on the 
moduli space. The Bohr-Sommerfeld fibres are also the central object of study in | JW92j . By the Verlinde 
formula [Verl88j . [Falt94| . the number of these fibres equals the dimension of the space of holomorphic 
sections of C®'^. This number is the value of the Hilbert function of the toric model of a connected graph 
with first Betti number g and with n leaves (see |JW92l Thm 8.3] and compare the conditions |JW92t 
(8.2a-c)] with Lemma 14.41 below) . 

Thanks to the Verlinde formula, which arises from mathematical physics, the Hilbert function of the 
semigroup algebra C[t(^)] has significant meaning. In the case of trivalent trees it was used in |SX10| 
and studied by Sturmfels and Velasco in [SVIO] , One of the features of this model is that the Hilbert 
function depends only on the combinatorial data [BW07] . |Buczl2| . This phenomena fails to be true for 
other models, even group- based models |KubjlO| , [DBMinj . 

As a summary, three distinct areas of science lead to study of the same object of purely combinatorial 
nature: the phylogenetic semigroup of a graph. Firstly, it generalises Cavender-Farris-Neyman model of 
a phylogenetic tree. Secondly, it is related to the moduli spaces of quasi-parabolic vector bundles and 
moduli spaces of marked curves. Finally, conformal field theory is interested in enumerating elements of 
the semigroup. 

In this paper we are interested in the problem of determining the degrees of elements in the minimal 
set of generators of the semigroup t{G). Originally this problem was suggested to us by J. Wisniewski 
and B. Sturmfels in a simplified version, where Q is trivalent. Thanks to a suggestion of a referee we 
extended the results to arbitrary graphs. First, we prove an upper bound for the degree of the generators 
in terms of first Betti number g of the graph G, see Section [3] for the proof. 

Theorem 1.1. Let G be any graph with first Betti number g. Any minimal generator of t[G) has degree 
at most 5 + 1. 

This result has been obtained in several special cases: for trivalent trees i.e. g( = 0, in |BW07j . for 
arbitrary trees in [DBMIO] . for trivalent graphs with (7 = 1 in |Buczl2) . 

Our second result shows that the upper bound of Theorem 1 1.1 1 is attained for certain graphs. We prove 
the theorem in Section [5] (see Example 14. 9p . See also Propositions 16.11 and Propositions 16.21 for extending 
the examples to graphs which are not trivalent, and with no loops. 

Theorem 1.2. Let g he even. There exists a graph G with first Betti number g and an element uj € t(G) 
of degree g + 1 which cannot be written as a non-trivial sum of two elements lo = uj' + uj" for oj' , uj" G t(^). 
Specifically, G a g-caterpillar graph (see Figure\^, and uj the labelling in Figure^is such an example. 

When g is odd, for all trivalent graphs with first Betti number g = 1 the bound is attained, as proved 
in [Buczl2j . Also, there exist graphs with g = 3, such that the bound is sharp. The simplest of these 
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is the 3-caterpillar graph; we illustrate an indecomposable degree 4 element in Section [6l The odd case 
follows from the even case, i.e. Theorem II. 2 [ 



Corollary 1.3. Let g be odd. There exists a graph Q with first Betti number g, and an element uj € t(Q) 
of degree g which cannot be written as a non-trivial sum of two elements in t{Q). Specifically, Q can be 
taken as the g- caterpillar graph. 

It remains to address the case when g is an odd integer greater than 3. It is natural to expect that 
when Q is the ^(-caterpillar, then there exists an indecomposable element in t[Q) of degree 5 + 1. This, 
however, is false. 

Theorem 1.4. Suppose Q is the g-caterpillar graph and oj G t{Q) is an element of even degree d > 6. 
Then lo = to' + uj" for some non-zero u}',uj" € t{Q). 

In summary, the maximal degree of generators of the semigroup for the (/-caterpillar graph is as follows. 

Corollary 1.5. Let Q be a g-caterpillar graph. Then the semigroup t{Q) is generated in degree 



Contrary to the case of the g-caterpillar graph, the conclusion of Theorem 11.41 is false for some other 
graphs. In Section[6]we present an indecomposable element of degree 6 on a graph with first Betti number 
6. However, we do not know if there exist a graph Q with odd first Betti number g > 5 such that t(Q) 
has a minimal generator of degree 5 + 1. 

A complete description of the generators of t{Q) is known for trivalent trees |BW07| . for trivalent 
graphs with first Betti number 1 |Buczl2] . We conclude by presenting results of some computational 
experiments. Namely, we list all the generators of t{G) when 5 < 4, and enumerate these generators when 
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2 Semigroup associated with a graph 

In this section we generalize the construction of t{G) introduced for trivalent graphs in jBuczl2] . 

Definition 2.1. A graph Q is a set V = V{Q) of vertices and a set £ = £{G) of edges, which we identify 
with pairs of vertices. We allow G to have loops or parallel edges. A graph is trivalent if every vertex has 




g + 1, if g is even or g £ {1,3} 
g, if g is odd and > 5. 
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= 5. 
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valency one or three. A vertex with valency one is called a leaf and an edge incident to a leaf is called 
a leaf edge. A vertex that is not a leaf is called an inner vertex. The set of inner vertices is denoted 

N = M{g). 

A path is a sequence of pairwise distinct edges eo, . . . , with ei PI ej+i / for alH € {0, . . . , m — 1}, 
such that either both eo and Cm contain a leaf, or cq H 7^ 0. In the latter case, the path is called a 
cycle. A cycle of length one is a loop. A graph with no cycles is a tree. Two paths are disjoint if they 
have no common edge. A network is a union of pairwise disjoint paths. For consistency we say that the 
empty set is also a network. An edge which is contained in a cycle is called cycle edge. First Betti 
number of a graph is the minimal number of cuts that would make the graph into a tree. 

Remark 2.2. Given the origins of the problem it is tempting to say genus of the graph instead of first 
Betti number. However, this is inconsistent with the graph theory notation, where genus of a graph is 
the smallest genus of a surface such that the graph can be embedded into that surface. 

Definition 2.3. Given a graph Q let IjE = ^^^^Z-e be the lattice spanned by £, and ZiS^ = Hom(Z£', Z) 
be its dual. Elements of the lattice Zf are formal linear combinations of the edges, thus £ forms the 
standard basis of Z£. The dual lattice Zf"^ comes with the dual basis {e*}e££- We define 

M = {uGZ£\yveM ^e*(M)G2Z}. 

Then the graded lattice of the graph, with the degree map, is 

= Z e M, deg : M^'' = Z M ^ Z, 
given by the projection onto the first summand. 

Definition 2.4. Given a tree T the phylogenetic polytope P{T) on T is 

P(T) = conv{^ Oee G M : Oe € {0, 1}}. 

That is points in P{T) H M correspond to networks on T. The phylogenetic semigroup t(T) on T is 

t{T) = cone{{l} X P{T)} n M9^ 

The definition of the phylogenetic polytope on a tree corresponds to the definition of the polytope of the 
2-state Jukes-Cantor binary model in [Mich 11a] . and in a different language in |SS05] . The phylogenetic 
semigroup on a tree is the semigroup associated to the phylogenetic polytope. 

e e e 

Figure 1: One step in the procedure of "cutting" a graph. 

To a given graph Q with first Betti number g we associate a tree T with g distinguished pairs of 
leaf edges. This procedure can be described inductively on g. li g = 0, then the graph is a tree with 
no distinguished pairs of leaf edges. For > we choose a cycle edge e. We divide e into two edges 
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e and e, adding two vertices / and I of valency 1. The edges e and e form a distinguished pair of leaf 
edges (see Figure [T|). This procedure decreases the first Betti number by one and increases the number 
of distinguished pairs by one. Note that usually the resulting tree with distinguished pairs of leaf edges 
is not unique, however a tree with distinguished pairs of leaf edges encodes precisely one graph and the 
following definition does not depend on the resulting tree. 

Definition 2.5. Let ^ be a graph. Let T be the associated tree with a set of distinguished pairs of leaves 
{(ej,ei)}. We define the phylogenetic semigroup on Q as 

T(g) =T(r)nf|Ker(ei*-e7*). 

i 

In other words, the labelling on the Cj is identical to one on ej, and thus the labelling descents to a 
labelling of Q. Thus t{Q) is canonically embedded in M3'^{Q). 

We identify paths and networks in Q as in Definition 12.11 with elements of the lattice M and replace 
union in 8 with sum in the group M C "LE. Under this identification, the networks correspond precisely 
to the degree one elements in t{Q). More precisely, we define: 

Definition 2.6. A network in the graded lattice Af^^' is a pair uj = (l,a) € M^*" where a € M is a 
network. 

3 The upper bound 

The goal of this section is to prove Theorem 11.11 We proceed in three steps. First we recall the result 
of [DBMIO, Prop. 3.12] that gives Theorem 11.11 in case 5 = 0: if T is a tree, the phylogenetic polytope 
P{T) is normal, meaning that any lattice point in the rescaling nP can be obtained as sum of n lattice 
points in P (usually not in a unique way). This implies that the semigroup t(T) is generated by r(T)i. 

Corollary 3.1. Let T he a tree. Every oj G T{T)d can be expressed as u = loi + ■ ■ ■ + cod, where each 
UJi G t(T)i is a network. 

In the second step, we represent a graph G with first Betti number g as a tree T together with g 
distinguished pairs of leaf edges, that are "glued" together. For an element u G t{Q) we consider the 
decomposition of the corresponding element in r(T) into a sum of degree 1 elements of t{T). To each 
such decomposition we assign a matrix with entries in {—1, 0, 1}. Since the decomposition is not unique, 
we study how simple modifications of the decomposition affect the matrix. Finally, we apply a sequence of 
these modifications to the matrix to prove that any sufficiently high degree element of t{Q) decomposes. 

3.1 Matrix associated to a decomposition of a lifted element 

Let ^ be a graph with first Betti number g and T the associated tree with g distinguished pairs of leaf 
edges. There is a one-to-one correspondence between elements of t{Q) and the elements of t(T) that 
assign the same value to the leaf edges in each distinguished pair. Thus we have the natural inclusion 
t{Q) C t{T). See |Buczl21 §2.2-2.3] for a more geometric interpretation of this inclusion. 

Let (jo be an element of t{Q). By Corollary 13.1} in the semigroup t(7~) there exists a decomposition 
u = ui + ■ ■ ■ + Wdeg(tj)) where Ui G t(T)i. Let = (wi, . . . ,Wdeg(a;)) ^"^^ consider the matrix i?Q with 
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deg(t<;) rows and g columns indexed by pairs of distinguished leaf edges. The entry in the i-th row and 
column indexed by a pair of distinguished leaf edges (e, e) is e*{uji) — e*{uji). Thus, since oji is a network 
e*{uji) G {0, 1} for any edge, entries of Bq are only —1, or 1. 

The matrix depends on the tree T and on the decomposition of u into the sum of degree one 
elements. An entry of Bq is zero when the corresponding network is compatible on the corresponding 
distinguished pair of leaf edges. Our aim is to decompose any element uj with deg((^) > g + 1 in t(Q). 
This means that we are looking for decompositions in r(T) that are compatible on the distinguished pairs 
of leaf edges. Hence, it is natural to consider matrices with as many zero entries as possible. 

Lemma 3.2. Let u be an element of t{T). Let uj = ui + ■ ■ ■ + Wdcg(w) be a decomposition of uj into 
networks. Let B^ he the matrix with deg(cL!) rows corresponding to the decomposition. For any subset of 
indices {ji, . . . , jp} C {1, . . . , deg(a;)} the following conditions are equivalent: 

(i) the element ujj^ + • • • + ujj^ is in t{Q); 

(a) in each column of B^i the sum of entries in rows ji,. . . ,jp is equal to zero. 

□ 

Even if we start from a decomposable uj the associated matrix might not have this property; it 
depends upon the choice of decomposition of oj in t{T). The following lemma shows how to change this 
decomposition in order to obtain a matrix with the required property. 

Lemma 3.3. Let u he an element of t{T). Let us choose a decomposition of uj that gives a matrix Bq 
with as many zeros as possible. Let us choose two entries in the matrix Bq that are in the same column 
indexed by {ei,ei). Suppose they are equal respectively, to 1 and —1. There exists a decomposition i}' of 
u) that yields a matrix Bqi the same as Bq, except for those two entries, which are interchanged. 

Proof. Let uj = 0Ji + ■ ■ ■ + a'deg(a;) be the given decomposition. Without loss of generality we may assume 
that the entries are in the first and second row. Hence uji associates to the edges and ei values and 
1 respectively, and similarly uj2 associates 1 and 0. 

To facilitate modifications of networks, we introduce the group of networks, following [Michllbl 
Def. 4.1]. The elements are networks, and the group addition is modulo 2, that is an edge is in the 
sum if and only if it is in exactly one of the summands. Formally: 

Notation 3.4. The group of networks is the subset of 

:= -e 

see 

such that a formal sum ei + 62 + • • • + £ Z2£^ is in the group of networks if and only if {ei, 62, . . . , e^} 
is a network. Note that this subset forms a subgroup of 'Ii2£- 

Let S be the set of all edges of the tree T on which the networks uji and uj2 disagree. S is a network 
and in fact S = uji + UJ2 (sum in the group of networks). Later we will replace S with other networks. 

Our aim is to construct a network b C S which realises the swapping of entries in the following sense. 
For networks uj[ = wi + 6 and UJ2 = uj2 + b (the sums in the group of networks), the new factorisation 
given by UJ = uj'i + uj'2 + UJ3 + ■■■ + ti'dcg(aj) (the sum is in t{T)) interchanges the two entries as desired. 
The network b will consist of paths {pi ,p2,P3, . . .), which we construct inductively. Define pi to be any 
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path contained in S starting at ei. It is possible as all inner vertices are adjacent to an even number of 
edges from S. Next, we replace S hy S + pi (sum in the group of networks). 

Suppose that we have constructed a sequence of paths pi, ■ ■ ■ ,Pm~i for m > 1, where the first edge of 
Pi is ei, the last is ej_,_x, and (cj, Cj) is a distinguished pair for alH G {1, . . . , m — 1}. After each inductive 
step, if pm is constructed we will replace S hy S + Pm, where the sum is taken in the group of networks. 



(i) If the edge is not paired, stop the construction. Otherwise go to Case (ii) 



(ii) If there is a distinguished pair (e^,em) and e5^(wi) / e*^{LOi) or e*^{uj2) 7^ e^('^2); i-e. at least 
one of the two entries in the column (e^,em) is non-zero, stop the construction. Otherwise go to 



Case 



m 



(iii) If there is a distinguished pair (e^,em) and a^{uji) = e^(a;i), e,m('^2) = e^(^2), then uji and 002 
disagree on em- Note that em is in S. Indeed, otherwise em would belong to some pi for i < m. As 
we have reached e„ by edges not belonging to any pi we must have = If this was true, we 



would have been in Case (ii) and the construction would have terminated. 

We define pm to be a path contained in 5 starting from em- Let em+i be the other end of the path 
Pm- We increase m by 1 and replace S by S + Pm, where the sum is taken in the group of networks. 



We start over from Case (i) 



Let us notice that the constructed paths are distinct, as each time we remove the edges of paths from 
S. In particular, the construction terminates. 

We define a network 6 C 5 to be the union of paths (pi, . . . ,Pm-i)- We use it to define two new 
networks u[ and UJ2- Namely, = Ui + b, where the sum is taken in the group of networks. In other 
words, u}[ (resp. uj'2) coincides with wi (resp. UJ2) on all edges apart from those belonging to the network 
b. On the latter ones uj[ (resp. ^2) is a negation of wi (resp. 0^2), hence coincides with uj2 (resp. wi). In 
particular, uji + uj2 = u}[ + uj'2, where this time the sum is taken in t{T). 

We get a decomposition Q.' = (w^, a;2, ws, . . . , ijOdcg(u])) with cj = ^ fi' and the associated matrix Bq,i. 
We claim that it exchanges the two chosen entries equal to 1 and —1. 

Consider each distinguished pair of leaf edges through which we passed during our construction of 
(pi, . . . ,pm-i)- If we did not stop at a pair (Zi, I2) each network oji and a;2 assigns the same value to li 



and I2 — otherwise we would have stopped because of Case (ii) , On these leaf edges to'^ and oj'2 agree 
with UJ2 and uji respectively. Hence, they also assign the same value to li and I2- In particular, both Bq, 
and B^i have zeros in the first two rows in the column indexed by {li,l2)- In fact, the only four entries 
on which Bq^ and Bq_i might possibly differ are the entries in first two rows in the columns indexed by 
(§1,61) or (e„,em), where is the last path. 



Suppose the construction stopped in (i) Then the last leaf edge is not paired, hence we change only 
entries in the column indexed by {e^ , ei ) . Since both u'^ and ^2 agree on e^ and ei , we have that B^i has 
two zeros, whereas i?n had 1 and —1. This contradicts the assumption that i?n has as many zeroes as 
possible. 



Now suppose the construction terminated in Case (ii) , Consider two sub-cases. 

1) The edges 7^ ei are distinct. We exclude this case. We change four entries in two columns. The 
two entries in the column indexed by (6^,61) are changed from 1 and —1 to zero. We know that matrix 
Bq^i has at most as many zero entries as Bq^. Hence the two entries in the column indexed by i^e^m^^m) 
must be changed from two zeros to two non-zeros. Having two zeros in Bq^ in those entries contradicts 



the assumptions of Case (ii) 
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2) The edges = are equal. In this case 6^ = 61, so we only exchange two entries in the column 
indexed by {ei,ei). This means that we have exchanged 1 and —1, which proves the lemma. □ 

Proof of Theorem \l. 1[ Consider an element uj of degree deg(a;) > g + 1 in t{G) and a tree T associated 
with the graph Q. Let us choose a decomposition of w in t{T), so that the associated matrix Bq has as 
many zero entries as possible. We find a subset of rows of the matrix Bq such that the sum of entries in 
each column is even as follows. Reduce the entries of Bq modulo 2 obtaining the matrix Cn with entries 
from Z2. We think of rows of C^j as vectors of the (/-dimensional vector space over the field Z2. We have 
deg{uj) > g + 1 such vectors. Hence we can find a strict subset of linearly dependent vectors. As we work 
over Z2 there exists a strict subset of these vectors summing to 0. The same subset R of rows in matrix 
Bq sums to even numbers in each column. 

Since uj G t{G), the sum of entries in each column of the matrix Bq is zero. Suppose the sum of 
entries in the rows from R is non-zero in a column. Using Lemma 13.31 we exchange the entries, changing 
the sum by 2 until it is equal to zero. This way we get a decomposition Q' of lo such that the rows from 
R sum to zero in each column. By Lemma 13.21 the sum of networks corresponding to rows from R is in 
t{Q). The sum of the remaining networks is also in t{Q). We have obtained a non-trivial decomposition 
of w. □ 



4 The upper bound is sharp for even g 

In this section we show that if g is even, the bound + 1 is sharp for a caterpillar graph with g loops. 
More generally to construct high degree indecomposable elements it suffices to consider trivalent graphs. 

Lemma 4.1. Let Q be a graph with first Betti number g and phylogenetic semigroup generated in degree 
n. There exists a trivalent graph Q' with first Betti number g and phylogenetic semigroup generated in 
degree > n. 

Proof. We construct G' from Q. Choose an inner vertex v oi G that is not trivalent. Replace v by v' and 
v" together with a new edge between them, let 2 edges incident to v be incident to v' and the rest of the 
edges incident to v be incident to v". After a finite number of replacements we get a trivalent graph Q', 
because valency{v') < valency{v) and valency{v") < valency{v). 

Now consider a tree T with g distinguished pairs of leaf edges associated to G that is attained by 
dividing edges ei, 62, . . . , into two. Dividing exactly the same edges ei, 62, . . . , into two in G' gives 
a tree T' with g distinguished pairs of leaf edges associated to G' ■ As t{'T) and t(T') are normal, the 
semigroup t(T) is a coordinate projection of the semigroup r(T') that forgets coordinates corresponding 
to new edges. Hence the semigroup t{G) is a coordinate projection of the semigroup t{G') and projections 
of generators of t{G') generate t(G)- □ 

4.1 Trivalent graphs 

We introduce notation and definitions specific to trivalent graphs useful for constructing high degree 
indecomposable elements. 

Notation 4.2. We denote the elements of the lattice dual to the edges meeting at the inner vertex v 

:= {iy{ei)y, by := (it,(e2))*, Cy := {ivies))*, 
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where {ei, 62, 63} are the edges of A, and : ^ Q is a map which is locaUy an embedding and sends 
the central vertex of A to u (see Figures [2] and [5]) . 

Given an element to in either TjS, M, or M^^', each of a^, by, € measures the coefficient of ui at 
an edge incident to v. 

Definition 4.3. The degree of a; £ M^' at an inner vertex v ^ J\f is 

deg„(w) := ^ • (a^iuj) + K{uj) + c^iuj)) 
(see Figure [3] for an illustrative example). 




Figure 2: A graph with four vertices t, u, v, w 
with a^,b^, c* indicated for each vertex. 




Figure 3: An element u G t{Q) with deg(a;) = 
4, together with values of degrees at each in- 
ner vertex. Thus at{uj) = ht{ui) = 3, q(cj) = 
6u(a;) = 2, etc. 



Following |Buczl2t Def. 2.18 & Lem. 2.23] we give the inequality description of phylogenetic semigroups 
for trivalent graphs. 

Lemma 4.4. For a trivalent graph Q the phylogenetic semigroup t{Q) on Q is the set of elements u 
satisfying the following conditions 

[W] parity condition: uj € M^*", 

[+] non-negativity condition: e*(w) > for any e G £, 

[A] triangle inequalities: |a„(f^) — by{oj)\ < c^(a;) < av{uj) + by{uj), for each inner vertex v € A^, 

[°] degree inequalities: deg(a;) > deg^(a;) for any v G M. 



The triangle inequalities [A] are symmetric and do not depend on the embedding i^. 

Remark 4.5. If every edge of Q contains at least one inner vertex, then the inequalities above imply 
deg{uj) > e*{u}) for all edges. On the other hand, in the degenerate cases where one of the connected 
components of G consists of one edge only, for consistency the inequality deg(w) > e*{uj) should be 
included in Lemma |4.4[ However, we will not consider these degenerate cases here. 
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4.2 Loops, caterpillar graphs, and local paths 



Assume Q is trivalent. We investigate the influence of loops in the graph Q on the semigroup t{Q), 
particularly on the parity condition. Then we define the ^-caterpillar graph and apply the conditions 
coming from loops to this case. Finally, we define an element of the phylogenetic semigroup and we prove 
it is indecomposable. 

Example 4.6. Let o € be a loop with unique vertex Vo € o. There is exactly one edge Co, other than 
o, such that Vq € Co- Loops force parity of the label on Cq, that is if cj € M then e*{u}) is even. It is a 
straightforward consequence of the parity condition or the definition of M in the neighbourhood of 

Vo- 



Figure 4: The g-caterpillar graph. 

The trivalent graph obtained from the caterpillar tree with g + 1 leaves by attaching a loop to all but one 
leaf (the leftmost one) is called the gr-caterpillar graph, see Figured! 

Example 4.7. Let Q be the (^-caterpillar graph and co G Z,£. The parity condition | [ W] | on the g(-caterpillar 
graph can be seen as a requirement of parity at each edge which is not a loop. That is a; G M if and only 
if e*{uj) is even on every edge e other than loops. 

The conditions defining t{Q) imply that every element uj G t{Q) decomposes locally in a unique way 
into paths around any vertex. This means that there exist non- negative integers related to 

a^,,6^,,c^, as in Figure [5] such that deg(w) > Xy + yv + (see |Buczl2t Sect. 2.4] for more details). In 
the case of the g'-caterpillar graph we denote the local paths at an inner vertex v on the horizontal line 
straight (z^), left (y^) and right (x^) paths, see Figure [H A consequence of Example 14.71 in terms of the 
local paths is the following. 




o 



Figure 5: The decomposition into local paths at any Figure 6: Notation for local paths on a 

vertex. vertex in a ^-caterpillar graph. 

Corollary 4.8. Let G be a g-caterpillar graph, uj G t{Q), v a vertex not on a loop. Then 
• if deg^, CO is even, Xy{uj),yv{uj), Zy^uj) are all even. 
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• ifdeg^LO is odd, Xy{uj),yy{uj), z,u(uj) are all odd. 
In particular, deg^(cL!) 7^ 1. 

Example 4.9. Suppose g = 2k is even, and let G be the g'-caterpillar graph. The element uj defined on 
Figure [7] is indecomposable. 

2k 2 2k -2 A 2 2k^ , 

1 1 1 • • • 1— — O k 

2k\ I ^ 



k k k k 

Figure 7: The indecomposable element oj of degree g + 1 on the ^(-caterpillar graph for even g. 

Proof. We begin the proof by explaining the local decomposition of oj. Starting from the left-most inner 
vertex of the caterpillar tree we have 



(1) 2k — 1 left, 1 right, 1 straight paths 



(2) 2k -2 right, 2 left paths 



(3) 2k — 3 left, 3 right, 1 straight paths 



2k -2i 







=^ 
=^ 




^ f 




^ 'a" 














41 


• • • • a 





{2k - 1) 1 left, 2k -1 right, 1 straight paths 



Suppose for a contradiction that co is decomposable as uj' + cj". Since the degree of w is odd, one of 
the two parts has even degree. Assume u' has even degree deg(w') = 2i with i > 0. 

Every second vertex v on the horizontal line has a single straight line in the local decomposition of to. 
Moreover at such v the degree is attained deg^ a; = degw. Thus deg^^w' = deguj' and deg^w" = degw" 
as well. By Corollary 14.81 the local decomposition of w" at v consists of the single straight path and odd 
number of left paths and odd number of right paths, whereas the local decomposition of uj' at v consists 
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of even number of left paths and even number of right paths. This means w' must have 2i left paths at 
the left-most inner vertex on the horizontal line of Q. At the next inner vertex on the horizontal line, uj' 
has 2i right paths by Example 14.71 so on. This is a contradiction, as at some inner vertex on the 
horizontal line uj has less than 2i left paths. □ 

5 A lower bound for odd g 

If g is odd, there exist graphs with first Betti number g with minimal generators of t{Q) in degree g. They 
are obtained by extending the labelling from Example 14.91 to the extra loop of the {g + l)-caterpillar. We 
do not know if the maximal generating degree is g or g -\- 1 among the graphs with first Betti number g. 
However, we know it for the g(-caterpillar graph. 

Lemma 5.1. Let Q he the g-caterpillar graph. Let uj G t[Q) he an element of even degree at least 6. Then 
UJ can he decomposed into degree 2 and deg{uj) — 2 elements. 

Proof. For this proof we fix the following notation. At each vertex v on the horizontal line of the g- 
caterpillar, we choose an embedding of the tripod so that a„, h^ and Cy are arranged as in Figure O so 
is the value on the vertical edge, on the left one, hy on the right one. 

Let d := deg(a;) be the degree of u. We will define a degree 2 element uj', so that uj = uj' + uj" is a 
decomposition in t[Q). In our construction we use local paths. This assures that the resulting uj' and uj" 
fulfill the triangle inequalities A] of r(Cy). To assure the degree inequalities [[°]| we require that uj' satisfies 



the following at each inner vertex v 

d-2> deg,{uj") = deg,{uj) - deg,{uj'). (5.2) 

Note that if deg^(a;') = 2, or equivalently, if uj' is constructed using two local paths at v, then ()5.2p is 
automatically fulfilled. 

First we define the labels of uj' on the caterpillar tree, ignoring the labels on the loops for a while. We 
define them inductively from left to right using local paths, in such a way that the following condition 
holds for every inner vertex v of the caterpillar tree 



b,{uj') = 

First we define uj' for the left-most edge e 

e*{uj') 



if6„(a;)<f, 

2 if6„(w)>f, (5.3) 

or 2 otherwise. 



ife*(w)<f, 
2 otherwise. 



We need to prove that at every step there is enough of local paths in uj to fulfill conditions (|5.2p and (|5.3p . 
There are six cases depending on the value of uj' on the previous edge and the value of uj on the current 
one. 

(i) If ay{uj') = 2 and b^{uj) > d/2, then we have to prove that uj has at least two straight paths at v, 
since we need h^iuj') = 2. The condition (j5.3p gives a^(ix)) > d/2, and 

a^{uj) + h^{uj)-c^{uj) d Cy{uj) 
#straight = Zy{uj) = > — > 0, 
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where the last inequality holds because of 



2d > av{uj) + bv{io) + Cv{u}) > - + ^ + Cv{uj) = d + Cv{uj). 

As d and 0^,(0;) are both even, we conclude that uj has at least two straight paths at v. 

(ii) If a„(a;') = 2 and 5t,(a;) = d/2, then we have to prove that uj has either at least two straight paths 
or at least two left paths at v, since we need either bv{u}') = 2 or h^ioj') = 0. The condition (j5.3p 
gives a^(a;) > d/2, and 

^straight + ^^left = z^{ijj) + yv{oj) = a^(cj) > ^ > 3. 

(iii) If ay[ixj') = 2 and < d/2, then we have to prove that u has at least two left paths at v, since 
we need 6^(0;') = 0. The condition (|5.3|) gives a„(a;) > d/2, and thus a^(a;) — hv{uj) > 0. By the 
triangle inequalities [A] we have 

#left = yy{uj) = ^ — > — — + - — ^-^^ = ay{u}) - bv{uj) > 1 

As 0^,(0;) and 6^(0;) are both even, we conclude that u has at least two left paths at v. 

(iv) If ay{uj') = and b^j^uj) > d/2, then we have to prove that u has at least two right paths at v, since 
we need by{uj') = 2. The condition (15. 3p gives a^(u;) < d/2, and thus 6^(0;) — 0^,(0;) > 0. Again, by 
the triangle inequalities [A] we have 

#right = Xy{uj) = > \ = by{uj)-ay{uj) > 1 

(v) If a„(a;') = and by{u}) = d/2, we have to prove that either degy{uj) < d — 2 or u has two right 
paths at V, since we need either by{u}') = or by{u}') = 2. If deg^(a;) > d — 1, using the condition 
(|5.3p gives 

#right = Xy{u;) = = deg^(a;) - a„(w) > (d - 1) - - > 2. 

(vi) If ay{uj') = and by{oj) < d/2, then we have to prove that deg^(a;) < d — 2, since we need by{uj') = 0. 
The condition (15. 3p gives 0^,(0;) < d/2, and thus 0^,(0;) + by{uj) < d — 1. As a^,(u;) and by{uj) are 
both even, we even have ay{uj) + by{u}) < d — 2. Using this and the triangle inequalities [A] we get 
the desired inequality: 

2deg„(w) = ay{(jj) + by{(jj) + Cy{oj) < d — 2 + Cy{uj) < d — 2 + ay{uj) + by{uj) < 2d — 4. 



Note that we use d > 6 only in cases with b = d/2, i.e., cases (ii) and (v) ). 

It remains to suitably define the labels of lo' on the loops. Fix a loop o. In the local decomposition of 
UJ at the vertex Vq some of the local paths come in pairs: There are e*{uj)/2 loops with 2 on the adjacent 
edge and 1 on the loop; there are {o*{uj) — e*(a;)/2) single loops with on the adjacent edge and 1 on the 
loop. 
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If e*(a;') = 2 then e*(a;) > 2, and there is at least one loop with 2 on the adjacent edge in the local 
decomposition of co. Set o*{u)') = 1. 

Otherwise e*{uj') = by the construction above. This implies together with the Remark 14.51 that 
e*{uj) < d — 2. Hence the number of single loops 

{o* - I) (u) = deg^^u) - eliu) > deg,„(a;) -d + 2, 

and we define 

o*{uj') = max{deg^_^ (w) -d + 2,0}. 
Finally we check that the condition (|5.2p is fulfilled. 

deg^„(w) - deg^„(w') < deg^„(w) " (deg^M -d + 2)<d-2. 
This completes the proof. □ 



6 Examples on small graphs 

We conclude the article with some examples of indecomposable elements for special cases of graphs with 
small first Betti number g. 



2 2 4 
1 1 



02 




4 4 




2 2 



^ |0 - 2^ 

1 eAe 



2 4 



0.0 




Figure 8: An indecompos- 
able element of degree 4 on 
the 3-caterpillar graph. 



Figure 9: An indecompos- 
able element of degree 6 on a 
graph with 6 loops and two 
leaves. 



Figure 10: An indecomposable 
element as in Figure [8] adapted 
to a graph with no loops and 
with vertices of high valency. 



The example on Figure [8] is an indecomposable element of degree 4 on the 3-caterpillar graph. It 
shows that our bound d > 6 in Theorem 11.41 is necessary, and also proves that, in the case (7 = 8, the 
upper bound of Theorem 11.11 is attained. 

On Figure [9] there is a degree 6 indecomposable element on a graph with 6 loops and one leaf. This 
shows that our decomposition Theorem 11.41 does not work on this non-caterpillar graph. 

Despite our examples in Sections H] and [6] are indecomposable elements on trivalent graphs that contain 
loops, it is possible to slightly modify those examples to graphs with no loops and to graphs of higher 
valency. This is provided by the following two elementary properties, and an example how to apply them 
is on Figure [lOl 

Proposition 6.1. Suppose e is an edge of G, and uj € t{Q) is such that e*{uj) = 0. Let Q' he the graph 
obtained from Q by removing the edge e and let uj' € t{Q') he the labelling identical with uj away from e. 
Then uj is indecomposable in t{Q) if and only if uj' is indecomposable in t{Q'). 
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9 


= 1 




d 


generator 


# 


1 


(0) 


2 


2 


(2) 


1 



a 


= 2 




d 


generator 


# 


1 


(0,0,0) 


4 


2 


(0, 2, 2) 


1 


2 


(2,0,2) 


3 


2 


(2, 2,0) 


3 


3 


(2, 2,2) 


4 



9 


= 3 




d 


generator 


# 


1 


(0,0,0,0,0) 


8 


2 


(0,0,0,2,2) 


3 


2 


(0,2,2,0,2) 


3 


2 


(0,2,2,2,0) 


3 


2 


(2,0,2,0,2) 


9 


2 


(2,0,2,2,0) 


9 


2 


(2,2,0,0,0) 


9 


2 


(2,2,0,2,2) 


1 


3 


(0,2,2,2,2) 


8 


3 


(2,0,2,2,2) 


16 


3 


(2,2,2,0,2) 


16 


3 


(2,2,2,2,0) 


16 


3 


(2,2,2,2,2) 


8 


4 


(2,2,2,2,4) 


9 


4 


(2,2,2,4,2) 


9 


4 


(2,4,2,2,2) 


9 


4 


(4,2,2,2,2) 


27 



9 


= 4 and d < 2 




d 


generator 


# 


1 


(0,0,0,0,0,0,0) 


16 


2 


(0,0,0,0,0,2,2) 


9 


2 


(0,0,0,2,2,0,2) 


9 


2 


(0,0,0,2,2,2,0) 


9 


2 


(0,2,2,0,2,0,2) 


9 


2 


(0,2,2,0,2,2,0) 


9 


2 


(0,2,2,2,0,0,0) 


9 


2 


(0,2,2,2,0,2,2) 


1 


2 


(2,0,2,0,2,0,2) 


27 


2 


(2,0,2,0,2,2,0) 


27 


2 


(2,0,2,2,0,0,0) 


27 


2 


(2,0,2,2,0,2,2) 


3 


2 


(2,2,0,0,0,0,0) 


27 


2 


(2,2,0,0,0,2,2) 


3 


2 


(2,2,0,2,2,0,2) 


3 


2 


(2,2,0,2,2,2,0) 


3 




9 


= 4 and d = 3 






generator 


# 


3 


(0,0,0,2,2,2,2) 


32 


3 


(0,2,2,0,2,2,2) 


32 


3 


(0,2,2,2,2,0,2) 


32 


3 


(0,2,2,2,2,2,0) 


32 


3 


(0,2,2,2,2,2,2) 


16 


3 


(2,0,2,0,2,2,2) 


64 


3 


(2,0,2,2,2,0,2) 


64 


3 


(2,0,2,2,2,2,0) 


64 


3 


(2,0,2,2,2,2,2) 


32 


3 


(2,2,0,2,2,2,2) 


16 


3 


(2,2,2,0,2,0,2) 


64 


3 


(2,2,2,0,2,2,0) 


64 


3 


(2,2,2,0,2,2,2) 


32 


3 


(2,2,2,2,0,0,0) 


64 


3 


(2,2,2,2,0,2,2) 


16 


3 


(2,2,2,2,2,0,2) 


32 


3 


(2,2,2,2,2,2,0) 


32 


3 


(2,2,2,2,2,2,2) 


16 



9 


= 4, d 




4 




d 


generator 


# 


4 


(0,2 


2 


2,2,2,4) 


27 


4 


(0,2 


2 


2,2,4,2) 


27 


4 


(0,2 


2 


4,2,2,2) 


27 


4 


(0,4 


4 


2,2,2,2) 


27 


4 


(2,0 


2 


2,2,2,4) 


45 


4 


(2,0 


2 


2,2,4,2) 


45 


4 


(2,0 


2 


4,2,2,2) 


45 


4 


(2,2 


2 


0,2,2,4) 


45 


4 


(2,2 


2 


0,2,4,2) 


45 


4 


(2,2 


2 


2,0,4,4) 


9 


4 


(2,2 


2 


2,2,2,4) 


27 


4 


(2,2 


2 


2,2,4,2) 


27 


4 


(2,2 


2 


2,4,0,4) 


45 


4 


(2,2 


2 


2,4,2,2) 


81 


4 


(2,2 


2 


2,4,4,0) 


45 


4 


(2,2 


2 


4,2,0,2) 


45 


4 


(2,2 


2 


4,2,2,0) 


45 


4 


(2,2 


2 


4,2,2,2) 


27 


4 


(2,2 


2 


4,2,2,4) 


9 


4 


(2,2 


2 


4,2,4,2) 


9 


4 


(2,2 


4 


2,2,2,2) 


81 


4 


(2,4 


2 


0,2,2,2) 


45 


4 


(2,4 


2 


2,2,0,2) 


45 


4 


(2,4 


2 


2,2,2,0) 


45 


4 


(2,4 


2 


2,2,2,2) 


27 


4 


(2,4 


2 


2,2,2,4) 


9 


4 


(2,4 


2 


2,2,4,2) 


9 


4 


(2,4 


2 


4,2,2,2) 


9 


4 


(4,0 


4 


2,2,2,2) 


135 


4 


(4,2 


2 


0,2,2,2) 


135 


4 


(4,2 


2 


2,2,0,2) 


135 


4 


(4,2 


2 


2,2,2,0) 


135 


4 


(4,2 


2 


2,2,2,2) 


81 


4 


(4,2 


2 


2,2,2,4) 


27 


4 


(4,2 


2 


2,2,4,2) 


27 


4 


(4,2 


2 


4,2,2,2) 


27 


4 


(4,4 





2,2,2,2) 


27 



9 


= 4 and d = 5 




d 


generator 


# 


5 


(2,2,2,4,2,4,4) 


32 


5 


(4,4,2,4,2,2,2) 


64 


5 


(4,4,2,4,2,4,4) 


16 



Table 1: Generators of phylogenetic semigroup of g-caterpilar graph. 



Proposition 6.2. Suppose v is a two-valent vertex of Q and let ei and 62 he the two edges containing 
V. Then for any to € t{Q) we have ej((^) = 62(6^;). Furthermore, t{Q) is naturally isomorphic to t{Q'), 
where Q' is the graph obtained by removing v from Q and replacing ei and 62 with a single edge e. 

For g < A Table [U lists all generators of t{Q) by specifying the possible labellings on all edges except 
for the loops. The order of edges goes from left to right, beginning with the leaf, the second is the vertical 
edge towards the first loop, the third is the next horizontal edge, etc. For instance, the example of Figure[8] 
is encoded (2, 2, 2, 2, 4) and can be found in the table for = 3 in the 14*^ row. The label on each loop 
can be set to any integer in the range {^Ct,, . . . ,d — ^c^,}. In the third column we specify how many 
possibilities there are for the labelling on the loops. An analogous table for g = 5 would need 359 rows, 
thus we omit it from this article. 

Table [2] presents the numbers of generators of t{Q) in each degree, where Q is the (^-caterpillar graph, 
and g < 5. These calculations were obtained using the convex bodies package in Magma [BCP97] . [BBK] . 
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(/ 


.y = 1 


.y = 2 


= -i 




(7 = 5 


all 


3 


15 


163 


2708 


49187 


1 


2 


4 


8 


16 


32 


2 


1 


7 


37 


175 


781 


3 




4 


64 


704 


6624 


4 






54 


1701 


35190 


5 








112 


6560 



Table 2: Number of generators of the phylogenetic semigroup of ^-caterpillar graph in each degree. 
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